Running Head: CORRELATION COEFFICIENT NONNORMALITY 1 Reducing Bias and Error in the Correlation Coefficient Due to Nonnormality

نویسندگان

  • Anthony J. Bishara
  • James B. Hittner
چکیده

It is more common for educational and psychological data to be non-normal than to be approximately normal. This tendency may lead to bias and error in point estimates of the Pearson correlation coefficient. In a series of Monte Carlo simulations, the Pearson correlation was examined under conditions of normal and non-normal data, and it was compared to its major alternatives, including the Spearman rank-order correlation, the bootstrap estimate, the Box-Cox transformation family, and a generally normalizing transformation (i.e., rankit), as well as to various bias adjustments. Non-normality caused the correlation coefficient to be inflated by up to +.14, particularly when the non-normality involved heavy-tailed distributions. Traditional bias adjustments worsened this problem, further inflating the estimate. The Spearman and rankit correlations eliminated this inflation and provided conservative estimates. Rankit also minimized random error for most sample sizes, except for the smallest samples (n = 10), where bootstrapping was more effective. Overall, results justify the use of carefully chosen alternatives to the Pearson correlation when normality is violated.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bias and Precision of the Squared Canonical Correlation Coefficient Under Nonnormal Data Condition

Monte Carlo methods were employed to investigate the effect of nonnormality on the bias associated with the squared canonical correlation coefficient (Rc 2). The majority of Rc 2 estimates were found to be extremely biased, but the magnitude of bias was impacted little by the degree of nonnormality.

متن کامل

The Robustness of Test Statistics to Nonnormality and Specification Error in Confirmatory Factor Analysis

Monte Carlo computer simulations were used to investigate the performance of three X 2 test statistics in confirmatory factor analysis (CFA). Normal theory maximum likelihood )~2 (ML), Browne's asymptotic distribution free X 2 (ADF), and the Satorra-Bentler rescaled X 2 (SB) were examined under varying conditions of sample size, model specification, and multivariate distribution. For properly s...

متن کامل

Conditions for Robustness to Nonnormality on Test Statistics in a Gmanova Model

This paper presents the conditions for robustness to the nonnormality on three test statistics for a general multivariate linear hypothesis, which were proposed under the normal assumption in a generalized multivariate analysis of variance (GMANOVA) model. The proposed conditions require the cumulants of an unknown population’s distribution to vanish in the second terms of the asymptotic expans...

متن کامل

The Efficiency of the Asymptotic Expansion of the Distribution of the Canonical Vector under Nonnormality

In canonical correlation analysis, canonical vectors are used in the interpretation of the canonical variables. We are interested in the asymptotic representation of the expectation, the variance and the distribution of the canonical vector. In this study, we derive the asymptotic distribution of the canonical vector under nonnormality. To obtain the asymptotic expansion of the canonical vector...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014